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TITLE OF THE INVENTION 

OPTIMIZED EXPRESSION OF HPV 52 LI IN YEAST 

FIELD OF THE INVENTION 
5 The present invention relates generally to the prevention and/or therapy of human 

papillomavirus (HPV) infection. More specifically, the present invention relates to synthetic 
polynucleotides encoding HPV 52 LI protein, and to recombinant vectors and hosts comprising said 
polynucleotides. This invention also relates to HPV 52 virus-like particles (VLPs), wherein the VLPs are 
produced by expressing recombinant HPV 52 LI or LI + L2 in yeast cells and to their use in vaccines 
10 and pharmaceutical compositions for preventing and treating HPV infections. 

BACKGROUND OF THE INVENTION 

There are more than 80 types of human papillomavirus (HPV), many of which have been 
associated with a wide variety of biological phenotypes, from benign proliferative warts to malignant 

15 carcinomas (for review, see McMurray et al., InL J, Exp, Pathol. 82(1); 15-33 (2001)). HPV6 and 

HPVl 1 are the types most commonly associated with benign warts, nonmalignant condyloma acuminata 
and/or low-grade dysplasia of the genital or respiratory mucosa. HPV 1 6 and HPVl 8 are the high-risk 
types most frequently associated with in situ and invasive carcinomas of the cervix, vagina, vulva and 
anal canal. More than 90% of cervical carcinomas are associated with infections of HPV 16, HPV 18 or 

20 the less prevalent oncogenic types HPV3 1, -33, -45, -52 and -58 (Schiffman et al., J. Natl. Cancer Inst. 
85(12): 958-64 (1993)). The observation that HPV DNA is detected in 90-100% of cervical cancers 
provides strong epidemiological evidence that HPVs cause cervical carcinoma {see Bosch et al., J. Clin. 
Pathol. 55: 244-265 (2002)). 

Papillomaviruses are small (50-60 nm), nonenveloped, icosahedral DNA viruses that 

25 encode up to eight early and two late genes. The open reading frames (ORFs) of the viral genomes are 
designated El to E7, and LI and L2, where "E" denotes early and "L" denotes late. LI and L2 code for 
virus capsid proteins, while the E genes are associated with functions such as viral replication and cellular 
transformation. 

The LI protein is the major capsid protein and has a molecular weight of 55-60 kDa. 
30 The L2 protein is the minor capsid protein. Immunological data suggest that most of the L2 protein is 
internal to the LI protein in the viral capsid. Both the LI and L2 proteins are highly conserved among 
different papillomaviruses. 

Expression of the LI protein or a combination of the LI and L2 proteins in yeast, insect 
cells, mammalian cells or bacteria leads to self-assembly of virus-like particles (VLPs) (for review, see 
35 Schiller and Roden, in Papillomavirus Reviews: Current Research on Papillomaviruses', Lacey, ed. 
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Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). VLPs are morphologically similar to 
authentic virions and are capable of inducing high titres of neutralizing antibodies upon administration 
into animals or humans. Because VLPs do not contain the potentially oncogenic viral genome, they 
present a safe alternative to the use of live virus in HPV vaccine development (for review, see Schiller 
5 and Hidesheim, 7. Clin, Virol. 19: 67-74 (2000)). For this reason, the LI and L2 genes have been 
identified as immunological targets for the development of prophylactic and therapeutic vaccines for 
HPV infection and disease. 

HPV vaccine development and commercialization have been hindered by difficulties 
associated with obtaining high expression levels of capsid proteins in successfully transformed host 

10 organisms, limiting the production of purified protein. Therefore, despite the identification of wild-type 
nucleotide sequences encoding HPV LI proteins such as HPV 52 LI protein, it would be highly desirable 
to develop a readily renewable source of crude HPV LI protein that utilizes HPV 52 LI -encoding 
nucleotide sequences that are optimized for expression in the intended host cell. Additionally, it would 
be useful to produce large quantities of HPV 52 LI VLPs having the immunity-conferring properties of 

15 the native proteins for use in vaccine development. 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods to elicit or enhance immunity 
to the protein products expressed by HPV 52 LI genes. Specifically, the present invention provides 

20 polynucleotides encoding HPV 52 LI protein, wherein the polynucleotides have been codon-optimized 
for high level expression in a yeast cell. In alternative embodiments of the invention, the nucleotide 
sequence of the polynucleotide is altered to eliminate transcription termination signals that are recognized 
by yeast. The present invention further provides HPV 52 virus-like particles (VLPs), wherein said VLPs 
are produced by expressing recombinant HPV 52 LI or LI + L2 in yeast cells, and discloses use of HPV 

25 52 VLPs in immunogenic compositions and vaccines for the prevention and/or treatment of HPV disease 
and HPV-associated cancer. 

The present invention relates to synthetic DNA molecules encoding the HPV 52 LI 
protein. The codons of the synthetic molecules are designed so as to use the codons preferred by a yeast 
cell. In an alternative embodiment of the invention, the nucleotide sequence of the synthetic molecule is 

30 altered to eliminate transcription termination signals that are recognized by yeast. The synthetic 

molecules may be used as a source of HPV 52 LI protein, which may self-assemble into VLPs. Said 
VLPs may be used in a VLP-based vaccine. 

An exemplary embodiment of the present invention comprises a synthetic nucleic acid 
molecule which encodes the HPV 52 LI protein as set forth in SEQ ID NO:2, said nucleic acid molecule 

35 comprising a sequence of nucleotides that is codon-optimized for high-level expression in a yeast cell. 
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Also provided are recombinant vectors and recombinant host cells, both prokaryotic and 
eukaryotic, which contain the nucleic acid molecules disclosed throughout this specification. In a 
preferred embodiment of the present invention, the host cell is a yeast ceil. 

The present invention also relates to a process for expressing an HPV 52 LI protein in a 
5 recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV 
52 LI protein into a yeast host cell; and (b) culturing the yeast host cell under conditions which allow 
expression of said HPV 52 LI protein. 

The present invention further relates to a process for expressing an HPV 52 LI protein in 
a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid molecule 
10 encoding an HPV 52 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon- 
optimized for optimal expression in the yeast host cell and; (b) culturing the yeast host cell under 
conditions which allow expression of said HPV 52 LI protein. 

In preferred embodiments, the nucleic acid molecule comprises a sequence of nucleotides 
as set forth in SEQ ID NO:l (designated herein "52 LI R sequence"). 
1 5 This invention also relates to HPV 52 virus-like particles (VLPs) which are produced in 

yeast cells, methods of producing HPV 52 VLPs, and methods of using HPV 52 VLPs. 

In a preferred embodiment of the invention, the yeast is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluy vermyces fragilis, 
Kluveromyces lactis, and Schizosaccharomyces pombe. 
20 Another aspect of this invention is an HPV 52 VLP, wherein the VLP is produced by 

recombinant expression of HPV 52 LI or HPV 52 LI + L2 in a yeast cell. 

Yet another aspect of this invention is an HPV 52 VLP which comprises an HPV 52 LI 
protein produced by a codon-optimized HPV 52 LI gene. In an exemplary embodiment of this aspect of 
the invention, the codon-optimized HPV 52 LI gene comprises a sequence of nucleotides as set forth in 
25 SEQ ID NO: 1. 

This invention also provides a method for inducing an immune response in an animal 
comprising administering HPV 52 virus-like particles to the animal. In a preferred embodiment, the HPV 
52 VLPs are produced by a codon-optimized gene. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
30 cervical cancer comprising administering to a mammal a vaccine comprising HPV 52 VLPs. In a 
preferred embodiment of this aspect of the invention, the HPV 52 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV 52 virus-like particles (VLPs), 
wherein the HPV 52 VLPs are produced in yeast. 

In an alternative embodiment of this aspect of the invention, the vaccine further 
35 comprises VLPs of at least one additional HPV type. The at least one additional HPV type may be any 
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HPV type of interest, including any HP V type described in the art or those subsequently identified. In a 
preferred embodiment, the HPV type is a type that is associated with a clinical phenotype such as warts or 
cervical cancer. In a further preferred embodiment, the at least one additional HPV type is selected from 
the group consisting of: HPV6, HPVll, HPV16, HPV18, HPV31, HPV33, HPV35, HPV39, HPV45, 
5 HPV51, HPV55, HPV56, HPV58, HPV59, and HPV68. 

This invention also relates to pharmaceutical compositions comprising HPV 52 virus-like 
particles, wherein the HPV 52 VLPs are produced in yeast. Further, this invention relates to 
pharmaceutical compositions comprising HPV 52 VLPs and VLPs of at least one additional HPV type. 
In a preferred embodiment, the at least one additional HPV type is selected from the group consisting of: 
10 HPV6, HPVl 1, HPV16, HPV18, HPV31, HPV33, HPV35, HPV39, HPV45, HPV51, HPV55, HPV56, 
HPV58, HPV59, and HPV68. 

As used throughout the specification and in the appended claims, the singular forms "a," 
**an," and "the" include the plural reference unless the context clearly dictates otherwise. 
15 As used throughout the specification and appended claims, the following definitions and 

abbreviations apply: 

The term "promoter" refers to a recognition site on a DNA strand to which the RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 

20 "upstream activating sequences" or inhibiting sequences termed "silencers". 

The term "vector" refers to some means by which DNA fragments can be introduced into 
a host organism or host tissue. There are various types of vectors including plasmids, viruses (including 
adenovirus), bacteriophages and cosmids. 

The term "cassette" refers to a nucleotide or gene sequence that is to be expressed from a 

25 vector, for example, the nucleotide or gene sequence encoding the HPV 52 LI protein. In general, a 
cassette comprises a gene sequence inserted into a vector which, in some embodiments, provides 
regulatory sequences for expressing the nucleotide or gene sequence. In other embodiments, the 
nucleotide or gene sequence provides the regulatory sequences for its expression. In further 
embodiments, the vector provides some regulatory sequences and the nucleotide or gene sequence 

30 provides other regulatory sequences. For example, the vector can provide a promoter for transcribing the 
nucleotide or gene sequence and the nucleotide or gene sequence provides a transcription termination 
sequence. The regulatory sequences which can be provided by the vector include, but are not limited to, 
enhancers, transcription termination sequences, splice acceptor and donor sequences, introns, ribosome 
binding sequences, and poly(A) addition sequences. 
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The designations "52 LI wild-type sequence" and "52 LI wt sequence" refer to the HPV 
52 LI sequence disclosed herein as SEQ ID NO:3. Ahhough the HPV 52 LI wild-type sequence has 
been described previously, it is not uncommon to find minor sequence variations between DNAs obtained 
from clinical isolates. Therefore, a representative HPV 52 LI wild-type sequence was isolated from 
5 clinical samples previously shown to contain HPV 52 DNA (see EXAMPLE 1). The HPV 52 LI wild- 
type sequence was used as a reference sequence to compare the codon-optimized HPV 52 LI sequences 
disclosed herein (see FIGURE 1). 

The designations "HPV 52 LI R" and "52 LI R" refer to an exemplary synthetic HPV52 
LI nucleotide sequence (SEQ ID NO: 1), disclosed herein, wherein the sequence was rebuilt so that it 
10 comprises codons that are preferred for high-level expression by a yeast cell. 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 
recognizes that this level may vary. 

A "conservative amino acid substitution" refers to the replacement of one amino acid 
15 residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid). 

The term "mammalian" refers to any mammal, including a human being. 
20 "VLP" or "VLPs" mean(s) virus-like particle or virus-like particles. 

"Synthetic" means that the HPV 52 LI gene was created so that it contains a sequence of 
nucleotides that is not the same as the sequence of nucleotides present in the designated naturally 
occurring wild-type HPV 52 LI gene (52 LI wt, SEQ ID NO:3). As stated above, synthetic molecules 
are provided herein comprising a sequence of nucleotides comprising codons that are preferred for 
25 expression by yeast cells. The synthetic molecules provided herein encode the same amino acid 
sequences as the wild-type HPV 52 LI gene (SEQ ID NO:2). 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows a sequence alignment comparing nucleotides that were altered in the 
30 synthetic HPV 52 LI gene of the present invention (SEQ ID NO:l, indicated as "52 LI R") (See 

EXAMPLE 2). The reference sequence is the 52 LI wild-type sequence (SEQ ID NO:3, indicated as "52 
LI wt"; see EXAMPLE 1). Altered nucleotides are indicated at their corresponding location. Nucleotide 
number is contained within the parentheses. Identical nucleotides in the 52 LI rebuilt sequence are 
indicated with dots. 
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FIGURE 2 shows the rebuilt synthetic HPV 52 LI double-stranded nucleic acid (SEQ ID 
NOs. l and 7) and single-code amino acid sequence (SEQ ID NO:2). Nucleotide number is indicated to 
the left. 

FIGURE 3 shows a Northern blot of HPV 52 LI wt and HPV 52 LI R transcripts (see 
5 EXAMPLE 4). The blot was probed with a mixture of DNA probes generated against both the 52 LI wt 
and the 52 LI R sequences. The arrow on right indicates the predicted position of a full-length 52 LI 
transcript. No transcripts of any length were detected in the 5 and 10 \xg lanes of 52 LI wt RNA. Full- 
length transcripts are apparent in the 52 LI R, in both the 5 and 10 jig lanes. 

FIGURE 4 shows a Western Blot of HPV 52 LI wt (52 wt), and 52 LI R (52R) proteins. 
10 HPV 16 LI was included as a reference (16). Ten, five and two and one-half micrograms of total yeast 
protein extract were denatured and applied to a 10% SDS-PAGE gel. The protein was Western 
transferred. HPV 52 LI protein was detected on the resulting blot using a yeast-absorbed anti-trpE-HPV 
31 LI goat polyclonal antiserum which cross-reacts with HPV 52 LI and HPV 16 LI. Molecular weight 
markers are indicted in kDa on the left. The arrow indicates the position of the '--55 kDa HPV 52 LI 
15 protein. 

FIGURE 5 shows a representative sample of HPV 52 VLPs composed of HPV 52 LI R 
protein molecules, described herein, as visualized by transmission electron microscopy (see EXAMPLE 
7). The diameter of the spherical particles in this crude sample ranged from between 40 and 70 nm with 
some particles displaying a regular array of capsomers. The bar represents approximately 0. 1 \xm, 

20 

DETAILED DESCRIPTION OF THE INVENTION 

The majority of cervical carcinomas are associated with infections of specific oncogenic 
types of human papillomavirus (HPV). The present invention relates to compositions and methods to 
elicit or enhance immunity to the protein products expressed by genes of oncogenic HPV types. 

25 Specifically, the present invention provides polynucleotides encoding HPV 52 LI, wherein the 

polynucleotides are codon-optimized for high-level expression in yeast. The present invention also 
provides HPV52 virus-like particles (VLPs), which are produced in yeast, and discloses use of said 
polynucleotides and VLPs in immunogenic compositions and vaccines for the prevention and/or 
treatment of HP V-associated cancer. 

30 A wild-type HPV52 LI nucleotide sequence has been reported (Genbank Accession # 

NC 001592). The present invention provides synthetic DNA molecules encoding the HPV 52 LI protein. 
In one aspect of the invention, the synthetic molecules comprise a sequence of codons, wherein at least 
some of the codons have been altered to use the codons preferred by a yeast cell for high-level 
expression. In an alternative aspect of the invention, the nucleotide sequence of the synthetic molecule is 

35 altered to eliminate transcription termination signals that are recognized by yeast. The synthetic 
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molecules may be used as a coding sequence for expression of HPV 52 LI protein, which may self- 
assemble into VLPs. Said VLPs may be used in a VLP-based vaccine to provide effective 
immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell-mediated 
immunity. Such VLP-based vaccines may also be useful for treatment of already established HPV 
5 infections. 

Expression of HPV VLPs in yeast cells offers the advantages of being cost-effective and 
easily adapted to large-scale growth in fermenters. In addition, the yeast genome can be readily altered to 
ensure selection of recombinant, transformed yeast with increased growth and expression potential. 
However, many HPV LI proteins, including HPV 52 LI are expressed at levels in yeast cells which are 

10 lower than what is desirable for commercial scale-up (see EXAMPLE 2). 

Accordingly, the present invention relates to HPV 52 LI gene sequences that are 
"optimized** for high-level expression in a yeast cellular environment. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 
Because these codons provide the message for only 20 different amino acids (as well as transcription 

1 5 initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 
uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 
the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 

20 and TTG. Exhaustive analysis of genome codon use frequencies for microorganisms has revealed 

endogenous DNA of E, coli most commonly contains the CTG leucine-specifying codon, while the DNA 
of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 
rich polypeptide by an E, coli host will depend to some extent on the frequency of codon use. For 

25 example, it is likely that a gene rich in TTA codons will be poorly expressed in E, coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 
manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 

30 of exogenous genes in successfully transformed host organisms— a less "preferred" codon may be 

repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 
projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
recombinant protein expression. Thus, one aspect of this invention is an HPV 52 LI gene that is codon- 

35 optimized for high-level expression in a yeast cell. In a preferred embodiment of this invention, it has 
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been found that the use of alternative codons encoding the same protein sequence may remove the 
constraints on expression of HPV 52 LI proteins by yeast cells. 

In accordance with this invention, HPV 52 LI gene segments were converted to 
sequences having identical translated sequences but with alternative codon usage as described by Sharp 
5 and Cowe (Synonymous Codon Usage in Saccharomyces cerevisiae. Yeast 7: 657-678 (1991)), which is 
hereby incorporated by reference. The methodology generally consists of identifying codons in the wild- 
type sequence that are not commonly associated with highly expressed yeast genes and replacing them 
with optimal codons for high expression in yeast cells. The new gene sequence is then inspected for 
undesired sequences generated by these codon replacements (e.g., "ATTTA" sequences, inadvertent 

10 creation of intron splice recognition sites, unwanted restriction enzyme sites, high GC content, presence 
of transcription termination signals that are recognized by yeast, etc.). Undesirable sequences are 
eliminated by substitution of the existing codons with different codons coding for the same amino acid. 
The synthetic gene segments are then tested for improved expression. 

The methods described above were used to create synthetic gene segments for HPV 52 

15 LI , resulting in a gene comprising codons optimized for high-level expression. While the above 

procedure provides a summary of our methodology for designing codon-optimized genes for use in HPV 
vaccines, it is understood by one skilled in the art that similar vaccine efficacy or increased expression of 
genes may be achieved by minor variations in the procedure or by minor variations in the sequence. 

Accordingly, the present invention relates to a synthetic polynucleotide comprising a 

20 sequence of nucleotides encoding an HPV 52 LI protein, or a biologically active fragment or mutant 

form of an HPV 52 LI protein, the polynucleotide sequence comprising codons optimized for expression 
in a yeast host cell. Said mutant forms of the HPV 52 LI protein include, but are not limited to: 
conservative amino acid substitutions, amino-terminal truncations, carboxy-terminal truncations, 
deletions, or additions. Any such biologically active fragment and/or mutant will encode either a protein 

25 or protein fragment which at least substantially mimics the immunological properties of the HPV 52 LI 
protein as set forth in SEQ ID NO:2. The synthetic polynucleotides of the present invention encode 
mRNA molecules that express a functional HPV 52 LI protein so as to be useful in the development of a 
therapeutic or prophylactic HPV vaccine. 

One aspect of this invention is a codon-optimized nucleic acid molecule which encodes 

30 the HPV 52 LI protein as set forth in SEQ ID NO:2, said nucleic acid molecule comprising a sequence of 
nucleotides that are codon-optimized for high-level expression in a yeast cell. In a preferred embodiment 
of this aspect of the invention, the nucleic acied molecule comprises a sequence of nucleotides as set forth 
in SEQ ID NO:l. 
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The present invention also relates to recombinant vectors and recombinant host cells, 
both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. In a preferred embodiment of this invention, the host cell is a yeast host cell. 

The synthetic HPV 52 LI DNA, functional equivalents thereof, and fragments thereof, 
5 constructed through the methods described herein may be recombinantly expressed by molecular cloning 
into an expression vector containing a suitable promoter and other appropriate transcription regulatory 
elements. Said expression vector may be transferred into prokaryotic or eukaryotic host cells to produce 
recombinant HPV 52 LI protein. Techniques for such manipulations are fully described in the art 
(Sambrook et al. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory, Cold 
10 Spring Harbor, New York, (1989); Current Protocols in Molecular Biology, Ausubel et al., Green Pub. 
Associates and Wiley-Interscience, New York (1988); Yeast Genetics: A Laboratory Course Manual, 
Rose et al., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, (1990), which are hereby 
incorporated by reference in their entirety). 

Thus, the present invention relates to a process for expressing an HPV 52 LI protein in a 
1 5 recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV 
52 LI protein into a yeast host cell; and (b) culturing the yeast host cell under conditions which allow 
expression of said HPV 52 LI protein. 

The present invention further relates to a process for expressing an HPV 52 LI protein in 
a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV 
20 52 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon-optimized for optimal 
expression in the yeast host cell and; (b) culturing the yeast host cell under conditions which allow 
expression of said HPV 52 LI protein. 

This invention further relates to a process for expressing an HPV 52 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid as set forth in SEQ 
25 ID NO:l into a yeast host cell; and, (b) culturing the yeast host cell under conditions which allow 
expression of said HPV 52 LI protein. 

The synthetic genes of the present invention can be assembled into an expression cassette 
that comprises sequences designed to provide efficient expression of the HPV 52 LI protein in the host 
cell. The cassette preferably contains the synthetic gene, with related transcriptional and translational 
30 control sequences operatively linked to it, such as a promoter, and termination sequences. In a preferred 
embodiment, the promoter is the S, cerevisiae GAL J promoter, although those skilled in the art will 
recognize that any of a number of other known yeast promoters such as the GAL JO, GAL7, ADHl, TDH3 
or PGK promoters, or other eukaryotic gene promoters may be used, A preferred transcriptional 
terminator is the S. cerevisiae ADHI terminator, although other known transcriptional terminators may 
35 also be used. The combination of GALl promoter — ADHI terminator is particularly preferred. 
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Another aspect of this invention is an HPV 52 virus-hke particle (VLP) produced by 
recombinantly expressing the HPV 52 LI or LI + L2 genes in a yeast cell, methods of producing HPV 52 
VLPs, and methods of using HPV 52 VLPs. VLPs can self-assemble when LI, the major capsid protein 
of human and animal papillomaviruses, is expressed in yeast, insect cells, mammalian cells or bacteria 
5 (for review, see Schiller and Roden, in Papillomavirus Reviews: Current Research on Papillomaviruses^ 
Lacey, ed. Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). Morphologically indistinct HPV 
VLPs can also be produced by expressing a combination of the LI and L2 capsid proteins. VLPs are 
composed of 72 pentamers of LI in a T=7 icosahedral structure (Baker et al., Biophys. J, 60(6): 1445-56 
(1991)). 

10 VLPs are morphologically similar to authentic virions and are capable of inducing high 

titres of neutralizing antibodies upon administration into an animal. Immunization of rabbits (Breitburd 
et al., J. ViroL 69(6): 3959-63 (1995)) and dogs (Suzich et al., Proc. Natl Acad, Sci. USA 92(25): 1 1553- 
57 (1995)) with VLPs was shown to both induce neutralizing antibodies and protect against experimental 
papillomavirus infection. Additionally, immunization of adult women with HPV 16 VLPs was shown to 

15 protect against HPV 16 infection and HPV 16 cervical intraepithelial neoplasia (Koutsky et al. N. Engl J. 
Med, 347: 1645-51 (2002)). Because VLPs do not contain the potentially oncogenic viral genome and 
can self-assemble when expressed from a single gene, they present a safe alternative to the use of live 
virus in HPV vaccine development (for review, see Schiller and Hidesheim, J. Clin, Virol, 19: 67-74 
(2000)). 

20 Thus, the present invention relates to virus-like particles comprised of recombinant LI 

protein or recombinant LI + L2 proteins of HPV 52, wherein the recombinant protein is expressed in a 
yeast cell. 

As stated above, in a preferred embodiment of the invention, the HPV 52 VLPs are 
produced in yeast. In a further preferred embodiment, the yeast is selected from the group consisting of: 
25 Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, Kluveromyces 
lactis, and Schizosaccharomyces pombe. 

Another aspect of this invention is an HPV 52 VLP which comprises an HPV 52 LI 
protein produced by a codon-optimized HPV 52 LI gene. In a preferred embodiment of this aspect of the 
invention, the codon-optimized HPV 52 LI gene comprises a sequence of nucleotides as set forth in SEQ 
30 IDNO:l. 

Yet another aspect of this invention is a method of producing HPV 52 VLPs, comprising: 
(a) transforming yeast with a recombinant DNA molecule encoding HPV 52 LI protein or HPV 52 LI + 
L2 proteins; (b) cultivating the transformed yeast under conditions that permit expression of the 
recombinant DNA molecule to produce the recombinant HPV 52 protein; and (c) isolating the 
35 recombinant HPV 52 protein to produce HPV52 VLPs. 
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In a preferred embodiment of this aspect of the invention, the yeast is transformed with a 
codon-optimized HPV 52 LI gene to produce HPV 52 VLPs. In a particularly preferred embodiment, the 
codon-optimized HPV 52 LI gene comprises a sequence of nucleotides as set forth in SEQ ID NO:l. 

This invention also provides a method for inducing an immune response in an animal 
5 comprising administering HPV 52 virus-like particles to the animal. In a preferred embodiment, the HPV 
52 VLPs are produced by recombinantly expressing a codon-optimized gene encoding HPV 52 LI or 
HPV 52L1 +L2. 

Yet another aspect of this invention is a method of preventing and/or treating HPV- 
associated cervical cancer comprising administering to a mammal a vaccine comprising HPV 52 VLPs. 
10 In a preferred embodiment of this aspect of the invention, the HPV 52 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV 52 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
additional HPV type is selected from the group consisting of: HPV 6, HPV 1 1, HPV 16, HPV 18, HPV 
15 31, HPV 33, HPV 35, HPV 39, HPV 45, HPV 51, HPV 55, HPV 56, HPV 58, HPV 59, and HPV 68. 

In a preferred embodiment of this aspect of the invention, the vaccine further comprises 

HPV 16 VLPs. 

In another preferred embodiment of the invention, the vaccine further comprises HPV 16 
VLPs and HPV 18 VLPs. 

20 In yet another preferred embodiment of the invention, the vaccine further comprises HPV 

6 VLPs, HPV 1 1 VLPs, HPV 16 VLPs and HPV 18 VLPs. 

This invention also relates to pharmaceutical compositions comprising HPV 52 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HPV 52 VLPs and 
VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 

25 type is selected from the group consisting of: HPV 6, HPV 1 1, HPV 16, HPV 18, HPV 31, HPV 33, HPV 
35, HPV 39, HPV 45, HPV 51, HPV 55, HPV 56, HPV 58, HPV 59, and HPV 68. 

Vaccine compositions of the present invention may be used alone at appropriate dosages 
which allow for optimal inhibition of HPV 52 infection with minimal potential toxicity. In addition, co- 
administration or sequential administration of other agents may be desirable. 

30 The amount of virus-like particles to be introduced into a vaccine recipient will depend 

on the immunogenicity of the expressed gene product. In general, an immunologically or 
prophylactically effective dose of about 10 jag to 100 i^g, and preferably about 20 |ig to 60 jig of VLPs is 
administered directly into muscle tissue. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 

35 delivery are also contemplated. It is also contemplated that booster vaccinations may be provided. 
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Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
administration with adjuvants such as alum or Merck alum adjuvant, concurrently with or subsequent to 
parenteral introduction of the vaccine of this invention is also advantageous. 

All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled in the 
art without departing from the scope or spirit of the invention as defined in the appended claims. 

The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Determination of a Representative HPV 52 LI Sequence 

The HPV 52 LI sequence has been described previously (Genbank Accession # NC 
001592). It is not uncommon, however, to find minor sequence variations between DNAs obtained from 
clinical isolates. To determine a representative HPV 52 LI wild-type sequence, DNA was isolated from 
three clinical samples previously shown to contain HPV 52 DNA. HPV 52 LI sequences were amplified 
in a polymerase chain reaction (PCR) using Taq DNA polymerase and the following primers: 5' LI 5' - 
ATGTCCGTGTGGCGGCCTAGT - 3'(SEQ ID NO:4) and 3' 52 Bel II 5'-GAGATCT 
CAATTACACAAAGTG-3' (SEQ ID NO:5). The amplified products were electrophoresed on 
agarose gels and visualized by ethidium bromide staining. The 1500 bp LI bands were excised and 
DNA was purified using Geneclean Spin Kit (Q-Bio Gene, Carlsbad, CA). The DNA was then ligated to 
the TA cloning vector, pCR2. 1 (Invitrogen). TOPI OF* E, coli cells were transformed with the ligation 
mixture and plated on LB agar with kanamycin plus IPTG and X-gal for blue/white colony selection. 
The plates were inverted and incubated for 1 6 hours at 37°C. 

Colony PCR was performed on five white colonies originating from each of the three 
clinical isolates amplified. 5' LI and 3' 52 Bel II primers were used in a two-step PCR in which the first 
step comprised 10 cycles of 96^C for 15 seconds (denaturing), 55*'C for 30 seconds (annealing) and 68°C 
for 2 minutes (extension), and the second step comprised 35 cycles of an essentially similar program, 
except the annealing step was performed at 50^C for 30 seconds. PCR products were electrophoresed on 
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agarose gels and visualized by ethidium bromide staining. Several colonies from each clinical isolate 
contained amplified products with --1500 bp bands. The colonies were cultured in LB medium with 
kanamycin, shaking at 37°C for 16 hours. Minipreps were performed to extract the plasmid DNAs, which 
were digested with restriction endonucleases to demonstrate the presence of the LI gene in the plasmid. 
5 The resulting restriction fragments were viewed by agarose gel electrophoresis and ethidium bromide 
staining. 

DNA sequencing was performed on plasmids containing cloned LI inserts from each of 
the three clinical isolates. DNA and translated amino acid sequences were compared with one another 
and the previously published Genbank HPV 52 LI sequences. Sequence analysis of the three clinical 

10 isolates revealed that no sequence was identical to the Genbank sequence (Accession No. NC 001592). 
The pCR2.1 HPV 52L1 clone #2C was chosen to be the representative HPV 52 LI sequence and is 
referred to herein as the "52 LI wild-type sequence" (SEQ ID NO:3, see FIGURE 1). The sequence 
chosen as 52 LI wild-type (wt) contained one point mutation when compared to the Genbank sequence, 
which consisted of a silent mutation at nucleotide 1308 (adenine guanine). The amino acid sequence 

15 of the HPV 52 LI wt sequence was identical to the 52 LI Genbank sequence. 

The HPV 52 LI wild-type sequence was amplified using the 5' LI Bgl II primer (5' - G 
AGATCTCACAAAACAAAATGTCCGTGTGGC-3' (SEQ ID NO:6)) and the 3^52 
Bgl II primer described above to add Bgl II extensions. PGR was performed using Tag polymerase. The 
PGR product was electrophoresed on an agarose gel and visualized by ethidium bromide staining. The 

20 1500 bp band was excised and DNA was purified using the Geneclean Spin kit (Q-Bio Gene, Carlsbad, 
CA). The PGR product was then ligated to the pCR2. 1 vector and TOPI OF' cells were transformed with 
the ligation mixture. White colonies were cultured in LB medium with kanamycin, shaking at 37*^C for 
16 hours. Minipreps were performed to extract the plasmid DNA. The HPV 52 LI gene was released 
from the vector sequences with Bgl II restriction endonuclease digestions. The digested DNA was 

25 subjected to agarose gel electrophoresis and viewed by ethidium bromide staining. The LI band was 
purified using the Geneclean kit and ligated to a dephosphorylated, BamHl-digested pGALl 10 vector. 
TOPI OF' E.coli cells were transformed with the ligation mixture. To screen for the HPV 52 LI insert in 
the correct orientation, plasmid DNA from the colonies was PCR-amplified. DNA sequencing was 
conducted to confirm the sequence and orientation of the inserts. The selected clone was named 

30 pGALl lO-HPV 52L1 #5. Maxiprep DNA from the selected clone was prepared. Saccharomyces 

cerevisiae cells were made competent by spheroplasting with glusulase and transformed with pGALl 10- 
HPV 52L1 #5. The yeast transformation mixture was plated in Leu" sorbitol top-agar on Leu* sorbitol 
plates and incubated inverted for 3-5 days at 30°C. Colonies were picked and streaked for isolation on 
Leu- sorbitol plates. Isolated colonies were subsequently grown in 5 ml of 5 X Leu- Ade- sorbitol with 
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1.6% glucose and 4% galactose in rotating tube cultures at 30 °C to induce HPV 52 LI transcription and 
protein expression. 

EXAMPLE 2 

Yeast codon optimization 

Yeast-preferred codons have been described (Sharp, Paul M and Cowe, Elizabeth. 
Synonymous Codon Usage in Saccharomyces cerevisiae YEAST 7: 657-678 (1991)). Expression of the 
HPV 52 LI wt protein was detectable; however, the level of transcription was very low and not detectable 
by Northern blot. It was postulated that pre-mature transcription termination may be responsible for the 
low expression levels of the HPV 52 LI gene. To increase transcription of this gene and ensure full- 
length transcripts would be produced, the HPV 52 LI gene was rebuilt utilizing yeast-preferred codons. 
The sequence was inspected for the presence of yeast transcription termination signals that are recognized 
by yeast, and these sequences were eliminated by substitution with alternative codons, while preserving 
the same amino acid sequence. The rebuilt HPV 52 LI sequence, which comprises yeast codon- 
optimized sequences, contained 379 nucleotide alterations compared to the HPV 52 LI wt sequence. The 
resulting sequence is referred to herein as "52 LI R" (R = rebuild, see FIGURE 1). The nucleotide 
alterations between the 52 LI wt (SEQ ID NO:3) and 52 LI R (SEQ ID NO:l) sequences are shown in 
FIGURE 1. The translated amino acid sequence of 58 LI R was not altered (SEQ ID NO:2, see FIGURE 
2). The rebuilt sequence provides increased HPV 52 LI protein expression, which is a significant 
advance over the wild-type for use in vaccine development. 

The strategy employed to produce the optimized gene was to design long overlapping 
sense and antisense oligomers that span the gene, substituting nucleotides with yeast-preferred codon 
sequences while maintaining the amino acid sequence. These oligomers were used in place of template 
DNA in a PCR reaction with Pfu DNA polymerase. Additional amplification primers were designed and 
used to amplify the rebuilt sequences from template oligomers. 

The optimal conditions for amplification were section-specific; however, most reactions 
employed a program resembling 94°C for 5 minutes (denaturing) followed by 25 cycles of 95**C for 30 
sec (denaturing), 50-55^C for 30 sec (annealing), 72**C for 1.5 minute (extension), followed by a 72**C for 
7 minute final extension and 4°C hold. PCR products were examined by agarose gel electrophoresis. 
Bands of the appropriate size were excised and the DNA was purified from the gel slice. The amplified 
fragments were then used as templates to assemble the 1512 nt rebuilt HPV 52 LI gene. 

Following rebuild, the 15 12 nt band was gel purified, and ligated to pCR4 Blunt vector 
(Invitrogen, Carlsbad, CA). Following ligation, competent coli TOPIC cells were transformed with the 
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ligation mixture. Colonies were grown in 4 ml LB with ampicillin and plasmid DNA was extracted from 
the colonies by miniprep techniques. The plasmid DNA was sequenced to confirm the presence of the 
desired HPV 52 LI rebuild changes. To add BamHl extensions to both ends, the 52 LI R (rebuild) was 
re-amplified from pCR4B!unt-52 LI R. The amplified fragment was cloned as above and the resulting 
5 plasmid DNA was sequenced. The plasmid, pCR4 Blunt-52 LI R (Bam) was digested with BamHl and 
the resulting DNA fragment inserts were electrophoresed on an agarose gel. The ---1530 bp HPV 52 LI R 
(Bam) fragment was gel purified and ligated to ^awHI-digested pGALl 10. TOPI OF* E,coli (Invitrogen) 
cells were transformed with the ligation mixture. 

The resulting colonies were screened by PCR for the HPV 52 LI R insert in the correct 

10 orientation. Sequence and orientation were confirmed by DNA sequencing. Maxiprep plasmid DNA was 
prepared. S, cerevisiae cells were made competent by spheroplasting and transformed. The yeast 
transformation was plated in Leu" sorbitol top-agar on Leu- sorbitol agar plates and incubated inverted 
for 7 days. Colonies were picked and streaked for clonal isolation on Leu- sorbitol agar plates. Isolated 
colonies were subsequently grown in 5 ml of 5 X Leu- Ade- sorbitol with 1 .6% glucose and 4% galactose 

15 in rotating tube cultures at 30^C to induce LI transcription and protein expression. After 48 and/or 72 
hours, a culture volume equivalent to an ODgQO ~ 10 pelleted, the supernatant was removed and the 

pellets were frozen and stored -70°C. 

EXAMPLE 3 

20 RNA preparation 

Cell pellets of transformed yeast induced to express HPV 52 LI by galactose induction 
were thawed on ice, suspended in 0.8 ml of Trizol reagent (Life Technologies, Gibco BRL) and incubated 
at room temperature for 5 minutes. One fifth volume of chloroform was added to the vial. It was then 
shaken vigorously for 1 5 seconds to mix and incubated at room temperature for 3 minutes. After a 5 

25 minute centrifugation at 13 k rpms, the upper phase was collected and transferred to a new vial. 0.4 ml 
isopropanol was added to the vial. The mixture was incubated at room temperature for 10 minutes. To 
pellet the RNA, centrifugation was performed at 13 k rpms for 10 minutes. The supernatant was 
decanted, the RNA pellet washed with 75% EtOH and the centrifugation step was repeated. The 
supernatant was decanted and the RNA pellet was allowed to air dry for 15 minutes followed by 

30 suspension in RNase-free water. Spectrophotometry was performed to determine the concentration of 

RNA in the sample using the assumption that an A26O reading of 1 = 40 fig/ml RNA when the A260/280 

is 1.7-2.0. 
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EXAMPLE 4 

Northern Blot Analysis 

A 1.1% agarose formaldehyde gel was cast. Five and ten micrograms of RNA were 
combined with denaturing buffer (final concentrations: 6% formaldehyde, 50% formamide and 0.1 x 
5 MOPS) and heated to 65°C for 1 0 minutes. A one-tenth volume of gel loading buffer was added and the 
sample was loaded onto the gel. Electrophoresis was performed at 75 volts in 1 x MOPS buffer for ~ 3 
hours. The gel was washed for 60 minutes in 10 x SSC. 

The RNA was transferred to a Hybond-N+ nylon membrane (Amersham Biosciences, 
Piscataway, NJ) by capillary action over 16 hours in 10 x SSC. The RNA was then fixed to the nylon 

1 0 membrane by cross-linking using the Stratagene UV Stratalinker auto crosslink function (Stratagene, LA 
Jolla, CA). After fixing, the nylon membrane was allowed to air dry. 

The Roche DIG High Prime DNA Labeling and Detection Kit I (Hoffmann-La Roche 
Ltd., Basel, Switzerland) was used to label 52 LI wt and 52 LI R DNA sequences with DIG to be used as 
probes to detect 52 LI wt and 52 LI R RNA on the Northern blot. The pre-hybridization, hybridization, 

1 5 and immunological development using an anti-DIG alkaline phosphatase-conjugated antibody were 

performed per the manufactures recommendations. Briefly, the blot was pre-hybridized at 37°C for 30 
minutes with gentle shaking. The probe was denatured by heating to 95°C for 5 minutes and subsequent 
quenching on ice. The probe was added to the hybridization solution and applied to the membrane for 4 
hours at 44.6*^0 with gentle shaking. The hybridization solution was then removed and the blot was 

20 washed 2 x for 5 minutes in 2 x SSC with 0.1% SDS at room temperature, followed by an additional 
wash at 65**C with 0.5 x SSC and 0.1% SDS. The blot was then blocked for 30 minutes and anti-DIG 
alkaline phosphatase-conjugated antibody was applied at a 1 :5000 dilution for 30 minutes. The blot was 
washed and the presence of probe-bound RNA was determined by NBT/BCIP substrate detection of the 
alkaline phosphatase conjugated anti-DIG bound antibody. 

25 Initial analysis of yeast expressing HPV 52 LI wt suggested that HPV 52 LI protein was 

expressed; however, the level was low. Northern blot analysis of RNA from yeast extracts of cultures 
induced to express HPV 52 LI wt did not reveal any detectable HPV 52 LI RNA. Since some protein of 
the appropriate size was detected, it was clear that some full-length RNA transcripts were made. The 
HPV 52 LI gene was rebuilt with yeast-preferred codon sequences and was engineered to omit any 

30 possible premature transcription termination sites to ensure robust transcription. Northern blot analysis of 
the HPV 52 LI R transcript revealed that full-length transcripts were generated and detectable by 
Northern blot analysis (FIGURE 3). 
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EXAMPLE 5 

HPV 52 LI Protein Expression 

Frozen yeast cell pellets of galactose-induced cultures equivalent to OD600= 1 0 were 
thawed on ice and suspended in 300 |il of PC buffer (100 mM Na2HP04 and 0.5 M NaCl, pH 7.0) with 

2mM PMSF. Acid-washed 0.5mm glass beads were added at a concentration of ^ 0.5g/tube, The tubes 
were vortexed for 3 cycles of 5 minutes at 4°C with a 1 minute break. 7.5 of 20% TritonXlOO was 
added and the vortex step was repeated for 5 minutes at 4^C. The tubes were placed on ice for 15 
minutes, followed by centrifugation for 10 minutes at 4°C. The supernatant was transferred to a sterile 
microcentrifuge tube, which was labeled as total yeast protein extract, dated, and stored at -70°C. 

EXAMPLE 6 

Western Blot Analysis 

Total yeast protein extract from twenty isolated yeast colonies for each HPV 52 LI 
construct were analyzed by Western blot to confirm expression of HPV 52 LI protein after galactose 
induction. 

Ten, five, and two and one-half micrograms of total yeast protein extract were combined 
with SDS-PAGE loading buffer and heated to 95''C for 10 minutes. The HPV 16 LI protein, which is 
approximately 55 kDa, was included as a positive control, along with HPV LI -free total yeast protein 
extract as a negative control (data not shown). The proteins were loaded onto a 10% SDS-PAGE gel and 
electrophoresed in Tris-Glycine buffer. After protein separation, the proteins were Western-transferred 
from the gel to nitrocellulose and the resulting blot was blocked in 1 x diluent buffer (Kirkegaard and 
Perry Laboratories, Gaithersburg, MD) for 1 hour at room temperature with rocking. The blot was 
washed three times and yeast absorbed goat anti-trpE-HPV 31 LI serum, which cross-reacts with HPV 16 
and HPV 52 LI proteins, was applied at room temperature for 16 hours. The blot was then washed three 
times and incubated with a 1:2500 dilution of anti-goat-HRP conjugated antibody for 1 hr. The blot was 
again washed three times and NBT/BCIP detection substrate was applied (Kirkegaard and Perry 
Laboratories). Immunoreactive proteins were detected as purple bands on the blot. 

In all cases, the HPV 52 LI protein was detected as a distinct immunoreactive band on 
the nitrocellulose corresponding to approximately 55 kDa. (FIGURE 4) The intensity of the HPV 52 LI 
R band (2.5 \xg lane) appeared to be significantly greater than the HPV 52 LI wt band (10 fig). It was 
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clear that upon rebuilding, the expression level of codon-optimized HPV 52 LI R increased more than 
four-fold, which is the limit of direct comparison on the Western blot. 

EXAMPLE 7 

Transmission Electron Microscopy 

To demonstrate that the 52 LI protein was in fact self-assembling to form pentameric-Ll 
capsomers, which in turn self-assemble into virus-like particles, a partially purified HPV 52 LI R protein 
extract was subjected to transmission electron microscopy (TEM). 

Yeast were grown under small scale fermentation and pelleted. The resulting pellets 
were subjected to purification treatments. Pellet and clarified yeast extracts were analyzed by 
immunoblot to demonstrate HPV 52 LI protein expression and retention throughout the purification 
procedure. Clarified yeast extracts were then subjected to centrifixgation over a 45%-sucrose cushion and 
the resulting pellet was suspended in buffer for analysis of HPV 52 LI VLPs by TEM. 

A representative sample of the HPV 52 LI R VLPs produced is shown in FIGURE 5. 
The diameter of the spherical particles in this crude sample ranged from between 40 and 70 nm with some 
particles displaying a regular array of capsomers. 



- 18- 



WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
HPV52 LI protein as set forth in SEQ ID NO:2, the nucleic acid sequence being codon-optimized for 
high-level expression in a yeast cell. 

2. A vector comprising the nucleic acid molecule of claim 1. 

3. A host cell comprising the vector of claim 2. 

4. The host cell of claim 3, wherein the host cell is a yeast cell. 

5. The host cell of claim 4, wherein the yeast cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 

6. The host cell of claim 4, wherein the host cell is Saccharomyces cerevisiae. 

7. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
comprises a sequence of nucleotides as set forth in SEQ ID NO: 1 . 

8. Virus-like particles (VLPs) comprised of recombinant LI protein or recombinant 
LI + L2 proteins of HPV52, wherein the recombinant LI protein or the recombinant LI + L2 proteins are 
produced in yeast. 

9. The VLPs of claim 8, wherein the recombinant LI protein or recombinant LI + 
L2 proteins are encoded by a codon-optimized HPV52 LI nucleic acid molecule. 

10. The VLPs of claim 9, wherein the codon-optimized nucleic acid molecule 
comprises a sequence of nucleotides as set forth in SEQ ID NO:l. 

11. A method of producing the VLPs of Claim 9, comprising: 

(a) transforming yeast with a codon-optimized DNA molecule encoding 
HPV52 LI protein or HPV52 LI + L2 proteins; 
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(b) cultivating the transformed yeast under conditions that permit expression 
of the codon-optimized DNA molecule to produce a recombinant 
papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the VLPs of 
Claim 9. 

■I 

12. A vaccine comprising the VLPs of Claim 9. 

13. Pharmaceutical compositions comprising the VLPs of claim 9. 

14. A method of preventing HPV infection comprising administering the vaccine of 
Claim 12 to a mammal. 

15 A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 1 1 to an animal. 

16. The virus-like particles of Claim 9 w^herein the yeast is selected from the group 
consisting of Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



17. The virus-like particles of claim 16, wherein the yeast is Saccharomyces 



cerevisiae. 



18. The vaccine of claim 12, further comprising VLPs of at least one additional HPV 



type. 



19. The vaccine of claim 18 wherein the at least one additional HPV type is selected 
from the group consisting of: HPV6, HPVl 1, HPV16, HPV18, HPV31, HPV33, HPV35, HPV39, 
HPV45, HPV51, HPV55, HPV56, HPV58, HPV59, and HPV68. 

20. The vaccine of claim 19, wherein the at least one HPV type comprises HPV 16. 

21 . The vaccine of claim 20, further comprising HPVl 8 VLPs. 

22. The vaccine of claim 21 , further comprising HPV6 VLPs and HPVl 1 VLPs. 
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The vaccine of claim 22, 
The vaccine of claim 21, 
The vaccine of claim 23, 
The vaccine of claim 24, 
The vaccine of claim 26, 
The vaccine of claim 25, 



further comprising HPV3 1 VLPs, 
further comprising HPV3 1 VLPs. 
further comprising HPV45 VLPs. 
further comprising HPV45 VLPs. 
further comprising HPV58 VLPs. 
further comprising HPV58 VLPs. 



ABSTRACT OF THE DISCLOSURE 

Synthetic DNA molecules encoding the HPV 52 LI protein are provided. Specifically, 
the present invention provides polynucleotides encoding HPV 52 LI protein, wherein said 
polynucleotides are codon-optimized for high level expression in a yeast cell. In alternative embodiments 
of the invention, the nucleotide sequence of the synthetic molecule is altered to eliminate transcription 
termination signals that are recognized by yeast. The synthetic molecules may be used to produce HPV 
52 virus-like particles (VLPs), and to produce vaccines and pharmaceutical compositions comprising the 
HPV 52 VLPs. The vaccines of the present invention provide effective immunoprophylaxis against 
papillomavirus infection through neutralizing antibody and cell-mediated immunity and may also be 
useful for treatment of existing HPV infections. 
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FIGURE 1. HPV 52 LI Nucleotide Sequence Alignment 



52 LI wt 
52 LI R 



1) ATGTCCGTGTGGCGGCCTAGTGAGGCCACTGTGTACCTGCCTCCTGTACC 
C...A-A.. ATCC .,A-,T C...T....A.,A,.T.. 



52 LI wt 
52 LI R 



5 1 ) TGTCTCTAAGGTTGTAAGCACTGATGAGTATGTGTCTCGCACAAGCATCT 
A CTCT. .C..C..A..C..C..CA.A. . CTC 



52 LI wt 
52 LI R 



101) ATTATTATGCAGGCAGTTCTCGATTACTAACAGTAGGACATCCCTATTTT 
.C.C.C.T. .TTCC. ..A....GT.G..T..C,.T..C..A,.C,.C 



52 LI wt 
52 LI R 



151) 



TCTATTAAAAACACCAGTAGTGGTAATGGTAAAAAAGTTTTAGTTCCCAA 
C. .G TCCTCC C G..G..C..G A., 



52 LI wt 
52 LI R 



201) 



GGTGTCTGGCCTGCAATACAGGGTATTTAGAATTAAATTGCCGGACCCTA 
...C TT A..C-.C C.-G A A. 



52 LI wt 
52 LI R 



251) ATAAATTTGGTTTTCCAGATACATCTTTTTATAACCCAGAAACCCAAAGG 
.C..G..C C C. .TAG. ..C..C T A 



52 LI wt 
52 LI R 



301) 



TTGGTGTGGGCCTGTACAGGCTTGGAAATTGGTAGGGGACAGCCTTTAGG 
C T T..T C A..T..A..A..G.. 



52 LI wt 
52 LI R 



351) 



TGTGGGTATTAGTGGGCATCCTTTATTAAACAAGTTTGATGATACTGAAA 
. . .C CTC. ..T..C..A..G..G C..C..C 



52 LI wt 
52 LI R 



401) 



CCAGTAACAAATATGCTGGTAAACCTGGTATAGATAATAGGGAATGTTTA 
. .TC G. .C G. .A C C. .A G 



52 LI wt 
52 LI R 



451) 



TCTATGGATTATAAGCAGACTCAGTTATGCATTTTAGGATGCAAACCTCC 
C..C A A..G..T..C..G..T..T..G..A.. 



52 LI wt 
52 LI R 



501) TATAGGTGAACATTGGGGTAAGGGAACCCCTTGTAATAATAATTCAGGAA 
A..C C T..T..A C..C..C..T..T. 



52 LI wt 
52 LI R 



551) ATCCTGGGGATTGTCCTCCCCTACAGCTCATTAACAGTGTAATACAGGAT 
■ • ■ * ■ T ■ * C * ■ . • • lAi • • .^^T* . ^7 . ■ .^^T . « • C * * V ^PCC> . . . • C . . « ■ 



52 LI wt 
52 LI R 



601) 



GGGGACATGGTAGATACAGGATTTGGTTGCATGGATTTTAATACCTTGCA 
..T C..C..T..T..C T C..C..C 



52 LI wt 
52 LI R 



651) AGCTAGTAAAAGTGATGTGCCCATTGATATATGTAGCAGTGTATGTAAGT 



52 LI wt 
52 LI R 



7 01) ATCC AGATTATTTGCAAATGGCTAGCGAGCCATATGGTGACAGTTTGTTC 
.C C..C TCT. .A C TCC 



52 LI wt 
52 LI R 



751) 



TTTTTTCTTAGACGTGAGCAAATGTTTGTTAGACACTTTTTTAATAGGGC 
..C..CT.G...A,A..A C..C C..C..C..A.. 



52 LI wt 
52 LI R 



801) 



CGGTACCTTAGGTGACCCTGTGCCAGGTGATTTATATATACAAGGGTCTA 
T G A..T C..G..C..C T..C. 



52 LI wt 
52 LI R 



851) ACTCTGGCAATACTGCCACTGTACAAAGCAGTGCTTTTTTTCCTACTCCT 
T..C T C...TC.TC C..C..A A 



52 LI wt ( 901) AGTGGTTCTATGGTAACCTCAGAATCCCAATTATTTAATAAACCGTACTG 

52 LI R TC C C C G . . C . . C . . G . . A 

52 LI wt ( 951) GTTACAACGTGCGCAGGGCCACAATAATGGCATATGTTGGGGCAATCAGT 

52 LI R • . .G. . .A.A, .T. •A. .T C.,C..T..C T..C..A. 

52 LI wt (1001) TGTTTGTCACAGTTGTGGATACCACTCGTAGCACTAACATGACTTTATGT 

52 LI R ....C C. .C, .C. .0. -T. , .A.ATCT C..G... 

52Llwt (1051) GCTGAGGTTAAAAAGGAAAGCACATATAAAAATGAAAATTTTAAGGAATA 

52 LI R A..C..G TC...C..C..G..C C..C 

52Llwt (1101) CCTTCGTCATGGCGAGGAATTTGATTTACAATTTATTTTTCAATTGTGCA 

52 LI R .T.GA.A. .C. .T. .A C..C..G C..C..C T. 

52 LI wt (1151) AAATTACATTAACAGCTGATGTTATGACATACATTCATAAGATGGATGCC 

52 LI R .G. .C. .C. .G. .0 C..C T C..C C..T 

52 LI wt (12 01) ACTATTTTAGAGGACTGGCT^TTTGGCCTTACCCCACCACCGTCTGCATC 

52 LI R C..G..A C..TT.G..T A..C..T.. 

52 LI wt (1251) TTTGGAGGACACATACAGATTTGTCACTTCTACTGCTATAACTTGTCAAA 

52 LI R C A T C C C..C 

52 LI wt (1301) AAAACACGCCACCTAAAGGAAAGGAAGATCCTTTAAAGGACTATATGTTT 

52 LI R .G T A..G..T C.,A..G C C 

52Llwt (1351) TGGGAGGTGGATTTAAAAGAAAAGTTTTCTGCAGATTTAGATCAGTTTCC 

52 LI R A. .C. .C. .G. .G C T . . C . . G . . C. . A. . C . . 

52 LI wt (1401) TTTAGGTAGGAAGTTTTTGTTACAGGCAGGGCTACAGGCTAGGCCCAAAC 

52 LI R A..G A C G . . A . . T . . TT • G . . A A..A..GT 

52Llwt (1451) TAAAACGCCCTGCATCATCGGCCCCACGTACCTCCACAAAGAAGAAAAAG 

52 LI R .G. .GA.A. .A. .TAGC. .T. .T. . .A.A. .T C G... 

52 LI wt (1501) GTTAAAAGGTAA (SEQ ID NO : 3 ) 

52 LI R . .0. .G. .A. • . (SEQ ID N0:1) 



Figure 2. HPV 52 LI R Nucleotide and Amino Acid Sequences. 

MSVW RPS EAT VYLP PVP 

1 ATGTCCGTCT GGAGACCATC CGAAGCTACT GTCTACTTGC CACCAGTTCC 

TACAGGCAGA CCTCTGGTAG GCTTCGATGA CAGATGAACG GTGGTCAAGG 

GSK VVST DEY VSR TSIY 

51 AGTCTCTAAG GTTGTCTCTA CCGACGAATA CGTCTCCAGA ACCTCCATCT 

TCAGAGATTC CAACAGAGAT GGCTGCTTAT GCAGAGGTCT TGGAGGTAGA 

YYA GSS RLLT VGH PYF 

101 ACTACTACGC TGGTTCCTCT AGATTGTTGA CTGTCGGTCA CCCATACTTC 

TGATGATGCG ACCAAGGAGA TCTAACAACT GACAGCCAGT GGGTATGAAG 

SIKN TSS GNG KKVL VPK 

151 TCTATCAAGA ACACCTCCTC CGGTAACGGT AAGAAGGTCT TGGTTCCAAA 

AGATAGTTCT TGTGGAGGAG GCCATTGCCA TTCTTCCAGA ACCAAGGTTT 

VSG LQYR VFR IKL PDPN 

201 GGTCTCTGGT TTGCAATACA GAGTCTTCAG AATCAAGTTG CCAGACCCAA 

CCAGAGACCA AACGTTATGT CTCAGAAGTC TTAGTTCAAC GGTCTGGGTT 

KFG FPD TSFY NPE TQR 

2 51 ACAAGTTCGG TTTCCCAGAC ACTAGTTTCT ACAACCCAGA AACTCAAAGA 

TGTTCAAGCC AAAGGGTCTG TGATCAAAGA TGTTGGGTCT TTGAGTTTCT 

LVWA CTG LEI GRGQ PLG 

301 TTGGTCTGGG CTTGTACTGG TTTGGAAATC GGTAGAGGTC AACCATTGGG 

AACCAGACCC GAACATGACC AAACCTTTAG CCATCTCCAG TTGGTAACCC 

VGI SGHP LLN KFD DTET 

351 TGTCGGTATC TCTGGTCACC CATTGTTGAA CAAGTTCGAC GACACTGAAA 

ACAGCCATAG AGACCAGTGG GTAACAACTT GTTCAAGCTG CTGTGACTTT 

SNK YAG KPGI DNR ECL 

4 01 CCTCTAACAA GTACGCTGGT AAGCCAGGTA TCGATAACAG AGAATGTTTG 

GGAGATTGTT CATGCGACCA TTCGGTCCAT AGCTATTGTC TCTTACAAAC 

SMDY KQT QLC ILGC KPP 

4 51 TCTATGGACT ACAAGCAAAC TCAATTGTGT ATCTTGGGTT GTAAGCCACC 

AGATACCTGA TGTTCGTTTG AGTTAACACA TAGAACCCAA CATTCGGTGG 

IGE HWGK GTP CNN NSGN 

501 AATCGGTGAA CACTGGGGTA AGGGTACTCC ATGTAACAAC AACTCTGGTA 

TTAGCCACTT GTGACCCCAT TCCCATGAGG TACATTGTTG TTGAGACCAT 

PGD CPP LQLI NSV IQD 

551 ACCCAGGTGA CTGTCCACCA TTGCAATTGA TCAACTCCGT CATCCAAGAC 

TGGGTCCACT GACAGGTGGT AACGTTAACT AGTTGAGGCA GTAGGTTCTG 

GDMV DTG FGC MDFN TLQ 

601 GGTGACATGG TCGACACTGG TTTCGGTTGT ATGGACTTCA ACACCTTGCA 

CCACTGTACC AGCTGTGACC AAAGCCAACA TACCTGAAGT TGTGGAACGT 

ASK SDVP IDI CSS VCKY 

651 AGCTTCTAAG TCCGACGTCC CAATCGACAT CTGTTCCTCT GTCTGTAAGT 

TCGAAGATTC AGGCTGCAGG GTTAGCTGTA GACAAGGAGA CAGACATTCA 

PDY LQM ASEP YGD SLF 

7 01 ACCCAGACTA CTTGCAAATG GCTTCTGAAC CATACGGTGA CTCCTTGTTC 

TGGGTCTGAT GAACGTTTAC CGAAGACTTG GTATGCCACT GAGGAACAAG 

FFLR REQ MFV RHFF NRA 

751 TTCTTCTTGA GAAGAGAACA AATGTTCGTC AGACACTTCT TCAACAGAGC 



AAGAAGAACT CTTCTCTTGT TTACAAGCAG TCTGTGAAGA AGTTGTCTCG 

GTL GDPV PGD LYI QGSN 

801 TGGTACCTTG GGTGACCCAG TTCCAGGTGA CTTGTACATC CAAGGTTCCA 

ACCATGGAAC CCACTGGGTC AAGGTCCACT GAACATGTAG GTTCCAAGGT 

SGN TAT VQSS AFF PTP 

851 ACTCTGGTAA CACTGCTACT GTCCAATCCT CTGCTTTCTT CCCAACTCCA 

TGAGACCATT GTGACGATGA CAGGTTAGGA GACGAAAGAA GGGTTGAGGT 

SGSM VTS ESQ LFNK PYW 

901 TCTGGTTCCA TGGTCACCTC CGAATCCCAA TTGTTCAACA AGCCATACTG 

AGACCAAGGT ACCAGTGGAG GCTTAGGGTT AACAAGTTGT TCGGTATGAC 

LQR AQGH NMG ICW GNQL 

951 GTTGCAAAGA GCTCAAGGTC ACAACAACGG TATCTGTTGG GGTAACCAAT 

CAACGTTTCT CGAGTTCCAG TGTTGTTGCC ATAGACAACC CCATTGGTTA 

FVT VVD TTRS TNM TLC 

1001 TGTTCGTCAC CGTCGTCGAC ACTACTAGAT CTACTAACAT GACCTTGTGT 

ACAAGCAGTG GCAGCAGCTG TGATGATCTA GATGATTGTA CTGGAACACA 

AEVK KES TYK NENF KEY 

1051 GCTGAAGTCA AGAAGGAATC CACCTACAAG AACGAAAACT TCAAGGAATA 

CGACTTCAGT TCTTCCTTAG GTGGATGTTC TTGCTTTTGA AGTTCCTTAT 

LRH GEEF DLQ FIF QLCK 

1101 CTTGAGACAC GGTGAAGAAT TCGACTTGCA ATTCATCTTC CAATTGTGTA 

GAACTCTGTG CCACTTCTTA AGCTGAACGT TAAGTAGAAG GTTAACACAT 

ITL TAD VMTY IHK MDA' 

1151 AGATCACCTT GACCGCTGAC GTCATGACTT ACATCCACAA GATGGACGCT 

TCTAGTGGAA CTGGCGACTG CAGTACTGAA TGTAGGTGTT CTACCTGCGA 

TILE DWQ FGL TPPP SAS 

12 01 ACTATCTTGG AAGACTGGCA ATTCGGTTTG ACTCCACCAC CATCCGCTTC 

TGATAGAACC TTCTGACCGT TAAGCCAAAC TGAGGTGGTG GTAGGCGAAG 

LED TYRF VTS TAI TCQK 

12 51 CTTGGAAGAC ACTTACAGAT TCGTCACTTC CACTGCTATC ACCTGTCAAA 

GAACCTTCTG TGAATGTCTA AGCAGTGAAG GTGACGATAG TGGACAGTTT 

NTP PKG KEDP LKD YMF 

1301 AGAACACTCC ACCAAAGGGT AAGGAAGACC CATTGAAGGA CTACATGTTC 

TCTTGTGAGG TGGTTTCCCA TTCCTTCTGG GTAACTTCCT GATGTACAAG 

WEVD LKE KFS ADLD QFP 

1351 TGGGAAGTCG ACTTGAAGGA AAAGTTCTCT GCTGACTTGG ACCAATTCCC 

ACCCTTCAGC TGAACTTCCT TTTCAAGAGA CGACTGAACC TGGTTAAGGG 

LGR KFLL QAG LQA RPKL 

14 01 ATTGGGTAGA AAGTTCTTGT TGCAAGCTGG TTTGCAAGCT AGACCAAAGT 

TAACCCATCT TTCAAGAACA ACGTTCGACC T^AACGTTCGA TCTGGTTTCA 

KRP ASS APRT STK KKK 

14 51 TGAAGAGACC AGCTAGCTCT GCTCCAAGAA CTTCCACCAA GAAGAAGAAG 

ACTTCTCTGG TCGATCGAGA CGAGGTTCTT GAAGGTGGTT CTTCTTCTTC 

V K R * (SEQ ID NO: 2) 

1501 GTCAAGAGAT AA (SEQ ID NO : 1 ) 

CAGTTCTCTA TT (SEQ ID NO : 7 ) 



FIGURE 3. Expression of HPV 52 LI wt and 52 LI R Transcripts 
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FIGURE 4. Western Blot Analysis of HPV 52 LI Protein 
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FIGUR£ 5. Transmission EM of VLPs Composed of HPV 52 LI R Protein Molecules 




